Goto

Collaborating Authors

 Miami Gardens


Synthetic Multimodal Question Generation

arXiv.org Artificial Intelligence

Multimodal Retrieval Augmented Generation (MMRAG) is a powerful approach to questionanswering over multimodal documents. A key challenge with evaluating MMRAG is the paucity of high-quality datasets matching the question styles and modalities of interest. In light of this, we propose SMMQG, a synthetic data generation framework. SMMQG leverages interplay between a retriever, large language model (LLM) and large multimodal model (LMM) to generate question and answer pairs directly from multimodal documents, with the questions conforming to specified styles and modalities. We use SMMQG to generate an MMRAG dataset of 1024 questions Figure 1: An overview of SMMQG. Given userprovided over Wikipedia documents and evaluate stateof-the-art question style and modality requirements, SMmodels using it, revealing insights MQG selects question sources and produces questions into model performance that are attainable only and answers. The questions are grounded in the selected through style-and modality-specific evaluation question sources, and adhere to the question and modality data. Next, we measure the quality of data produced requirements.


PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models

arXiv.org Artificial Intelligence

Large language models (LLMs) have achieved remarkable success due to their exceptional generative capabilities. Despite their success, they also have inherent limitations such as a lack of up-to-date knowledge and hallucination. Retrieval-Augmented Generation (RAG) is a state-of-the-art technique to mitigate those limitations. In particular, given a question, RAG retrieves relevant knowledge from a knowledge database to augment the input of the LLM. For instance, the retrieved knowledge could be a set of top-k texts that are most semantically similar to the given question when the knowledge database contains millions of texts collected from Wikipedia. As a result, the LLM could utilize the retrieved knowledge as the context to generate an answer for the given question. Existing studies mainly focus on improving the accuracy or efficiency of RAG, leaving its security largely unexplored. We aim to bridge the gap in this work. Particularly, we propose PoisonedRAG , a set of knowledge poisoning attacks to RAG, where an attacker could inject a few poisoned texts into the knowledge database such that the LLM generates an attacker-chosen target answer for an attacker-chosen target question. We formulate knowledge poisoning attacks as an optimization problem, whose solution is a set of poisoned texts. Depending on the background knowledge (e.g., black-box and white-box settings) of an attacker on the RAG, we propose two solutions to solve the optimization problem, respectively. Our results on multiple benchmark datasets and LLMs show our attacks could achieve 90% attack success rates when injecting 5 poisoned texts for each target question into a database with millions of texts. We also evaluate recent defenses and our results show they are insufficient to defend against our attacks, highlighting the need for new defenses.


Here Are the Stadiums That Are Keeping Track of Your Face

Slate

"Your face is your ticket," goes the motto of A.I. startup Wicket. "Your face is your credential," says Alcatraz AI, another vendor. Both these companies sell facial recognition technology to sports stadiums across the country. Citi Field, home of the Mets, contracted with Wicket in 2022 to add facial recognition ticket kiosks to all stadium gates. BMO Stadium, home of the Los Angeles Football Club, began using Alcatraz AI technology the year before.


Tua Tagovailoa 'not afraid' of Super Bowl talk after NFL trade deadline: 'Full belief that we are capable'

FOX News

Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. The Miami Dolphins have been revamping their offense under first-year head coach Mike McDaniel. On Tuesday, they made two deals that have many believing this could be a Super Bowl contending team. Tua Tagovailoa is one of the believers.


WWE releases 2022 pay-per-view schedule

FOX News

Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. WWE is ready for 2022. The pro wrestling company released its pay-per-view schedule for the next year with two more shows left on the docket for the year, Survivor Series in November and TLC: Tables Ladders & Chairs in December. MIAMI GARDENS, FL - APRIL 1: John Cena looks on before his match against Dwayne ''The Rock'' Johnson during WrestleMania XXVIII at Sun Life Stadium on April 1, 2012 in Miami Gardens, Florida.